Multiscale Geometric Methods for Data Sets I: Multiscale SVD, Noise and Curvature

نویسندگان

  • Anna V. Little
  • Mauro Maggioni
  • Lorenzo Rosasco
چکیده

Large data sets are often modeled as being noisy samples from probability distributions μ in R, withD large. It has been noticed that oftentimes the supportM of these probability distributions seems to be well-approximated by low-dimensional sets, perhaps even by manifolds. We shall consider sets that are locally well approximated by k-dimensional planes, with k ≪ D, with k-dimensional manifolds isometrically embedded in R being a special case. Samples from μ are furthermore corrupted by D-dimensional noise. Certain tools from multiscale geometric measure theory and harmonic analysis seem well-suited to be adapted to the study of samples from such probability distributions, in order to yield quantitative geometric information about them. In this paper we introduce and study multiscale covariance matrices, i.e. covariances corresponding to the distribution restricted to a ball of radius r, with a fixed center and varying r, and under rather general geometric assumptions we study how their empirical, noisy counterparts behave. We prove that in the range of scales where these covariance matrices are most informative, the empirical, noisy covariances are close to their expected, noiseless counterparts. In fact, this is true as soon as the number of samples in the balls where the covariance matrices are computed is linear in the intrinsic dimension of M. As an application, we present an algorithm for estimating the intrinsic dimension ofM.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiscale geometric modeling of macromolecules I: Cartesian representation

This paper focuses on the geometric modeling and computational algorithm development of biomolecular structures from two data sources: Protein Data Bank (PDB) and Electron Microscopy Data Bank (EMDB) in the Eulerian (or Cartesian) representation. Molecular surface (MS) contains non-smooth geometric singularities, such as cusps, tips and self-intersecting facets, which often lead to computationa...

متن کامل

Multiscale Estimation of Intrinsic Dimensionality of Data Sets

We present a novel approach for estimating the intrinsic dimensionality of certain point clouds: we assume that the points are sampled from a manifold M of dimension k, with k << D, and corrupted by D-dimensional noise. When M is linear, one may analyze this situation by SVD: with no noise one would obtain a rank k matrix, and noise may be treated as a perturbation of the covariance matrix. Whe...

متن کامل

Multiscale Fusion for Seismic Geometric Attribute Enhancement

In this abstract, we propose a multiscale fusion technique to enhance seismic geometric attributes, such as dip and curvature, which are very sensitive to noise present in seismic data. For a give seismic section, first, we construct a Gaussian pyramid that allows us to generate the seismic attribute at different resolutions (scales). Then, all attributes at the different scales are fused toget...

متن کامل

A Novel Noise-Robust Texture Classification Method Using Joint Multiscale LBP

In this paper we describe a novel noise-robust texture classification method using joint multiscale local binary pattern. The first step in texture classification is to describe the texture by extracting different features. So far, several methods have been developed for this topic, one of the most popular ones is Local Binary Pattern (LBP) method and its variants such as Completed Local Binary...

متن کامل

Dynamics of Macro–Nano Mechanical Systems; Fixed Interfacial Multiscale Method

The continuum based approaches don’t provide the correct physics in atomic scales. On the other hand, the molecular based approaches are limited by the length and simulated process time. As an attractive alternative, this paper proposes the Fixed Interfacial Multiscale Method (FIMM) for computationally and mathematically efficient modeling of solid structures. The approach is applicable to mult...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012